255 results found.
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
Mandarin Chinese
Availability:
Freely Available
License:
Apache License v.2.0
Size:
15 GByteProduction Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Multi-mode Transformer Transducer with Stochastic Future Context
-
Paper track:8.6 Neural network training methods (including new/Poster Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Kwangyoun Kim | AISHELL-1 | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Bilingual
Languages:
English Mandarin Chinese
Availability:
From Data Center(s)
License:
Size:
1000 hoursProduction Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Raw Waveform Encoder with Multi-Scale Globally Attentive Locally Recurrent Networks for End-to-End Speech Recognition
-
Paper track:8.1 Feature extraction and low-level feature model/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Max W. Y. Lam | AISHELL-2 | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Multilingual
Languages:
Arabic Bengali Dari English German Hindi Iranian Persian Japanese Korean Mandarin Chinese Persian Russian Spansih Standard Arabic Tamil Thai Vietnamese Yue Chinese
Availability:
From Owner
License:
LDC
Size:
66 hoursProduction Status:
Existing-used
Use:
Language Identification
-
Paper title:Modeling and training strategies for language recognition systems
-
Paper track:4.1 Language identification and verification, lang/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Raphaël Duroselle | 2007 NIST Language Recognition Evaluation Test Set | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Multilingual
Languages:
Amharic Bosnian Croatian Dari English French Georgian Haitian Hausa Hindi Korean Mandarin Chinese Persian Portuguese Pushto Russian Spanish Turkish Ukrainian Urdu Vietnamese Yue Chinese
Availability:
From Owner
License:
LDC
Size:
215 hoursProduction Status:
Existing-used
Use:
Language Identification
-
Paper title:Modeling and training strategies for language recognition systems
-
Paper track:4.1 Language identification and verification, lang/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Raphaël Duroselle | 2009 NIST Language Recognition Evaluation Test Set | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Multilingual
Languages:
Arabic Bengali Dari Egyptian Arabic English Georgian Hindi Iranian Persian Italian Japanese Khmer Korean Lao Mandarin Chinese Min Nan Chinese Moroccan Arabic Panjabi Persian Russian Spanish Tagalog Thai Tigrinya Urdu
Availability:
From Owner
License:
LDC
Size:
640 hoursProduction Status:
Existing-used
Use:
Language Identification
-
Paper title:Modeling and training strategies for language recognition systems
-
Paper track:4.1 Language identification and verification, lang/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Raphaël Duroselle | 2008 NIST Speaker Recognition Evaluation | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
Arabic Bengali Dari Egyptian Arabic English Georgian Hindi Iranian Persian Italian Japanese Khmer Korean Lao Mandarin Chinese Min Nan Chinese Moroccan Arabic Panjabi Persian Russian Spanish Tagalog Thai Tigrinya Urdu
Availability:
From Owner
License:
LDC
Size:
950 hoursProduction Status:
Existing-updated
Use:
Language Identification
-
Paper title:Modeling and training strategies for language recognition systems
-
Paper track:4.1 Language identification and verification, lang/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Raphaël Duroselle | 2008 NIST Speaker Recognition Evaluation Training Set Part 2 | /N |
Documentation:
None
Speech
Evaluation Data,
Language Type:
Multilingual
Languages:
English French Mandarin Chinese
Availability:
From Owner
License:
Size:
4217 minutes Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Speaker Adversarial Training of DPGMM-based Feature Extractor for Zero-Resource Languages
-
Paper track:10.8 Zero-resource speech recognition/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yosuke Higuchi | ZeroSpeech 2017 | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
Mandarin Chinese
Availability:
Freely Available
License:
Apache License v.2.0
Size:
47897192 KByte Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Extract, Adapt and Recognize: an End-to-end Neural Network for Corrupted Monaural Speech Recognition
-
Paper track:8.5 Novel neural network architectures (e.g. seque/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jun Wang | AISHELL-1 | /N |
Documentation:
lexicon, speaker info
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
Mandarin Chinese
Availability:
Freely Available
License:
LDC
Size:
178 hours Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Framewise Supervised Training towards End-to-End Speech Recognition Models: First Results
-
Paper track:8.7 Discriminative acoustic training methods for A/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Mohan Li | AISHELL-1 | /N |
Documentation:
LDC2018S14 Documents
Written
Corpus,
Language Type:
Monolingual
Languages:
Mandarin Chinese
Availability:
Not Applicable
License:
Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Size:
100 MByte Production Status:
Newly created-in progress
Use:
Summarisation
-
Paper title:Summarizing Medical Conversations via Identifying Important Utterances
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yuanhe Tian | Chinese Medical Conversation Corpus (CMC) | /N |
Documentation:
None




